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Abstract 

We present a measure for characterizing statistical relationships between two time 
sequences. In contrast to commonly used measures like cross-correlations, coher- 
ence and mutual information, the proposed measure is non-symmetric and provides 
information about the direction of interdependence. It is closely related to recent 
attempts to detect generalized synchronization. However, we do not assume a strict 
functional relationship between the two time sequences and try to define the mea- 
sure so as to be robust against noise, and to detect also weak interdependences. 
We apply our measure to intracranially recorded electroencephalograms of patients 
suffering from severe epilepsies. 
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1 Introduction 



During the last years the analysis of synchronization phenomena received in- 
creasing attention. Such phenomena occur in nearly all sciences, including 
physics, astrophysics, chemistry, and even economy. Probably the most im- 
portant applications are in biology and medical sciences. In living systems, 
synchronization is often essential in normal functioning, while abnormal syn- 
chronization can lead to severe disorders. Typical examples are from neuro- 
sciences, where synchronization under normal conditions seems to be essential 
for the binding problem [1-4], whereas epilepsies are related to abnormally 
strong synchronization. 

Synchronization can manifest itself in different ways. At one extreme are cou- 
pled identical deterministic chaotic systems, which can synchronize perfectly: 
once the coupling exceeds a critical value, both systems move along identical 
orbits [5,6]. If the coupled systems are not identical, in general, they can not 
move along identical orbits. If they are both chaotic and noise-free, a strict 
relationship can still exist, provided the coupling is sufficiently strong. Let us 
denote by X = (x±, . . . , xjy) and Y = (y±, . . . , y^) two time sequences from 
which state vectors x n and y n can be reconstructed, e.g., as delay vectors. 
Let us also assume that one of the systems, say X, is driving the other. By 
this we mean that the evolution of x n is autonomous, while y„+i is a function 
of y„,x n , and probably of some external noise [7]. If there is no noise, and 
if the driving is non-singular, y n+ i = F(x„,y n ) with det(dFi/dx n k) ^ 0, this 
relationship can always be inverted (at least locally) and can be written as 
x„ = G(y n ,y n+ i) or, after eventually increasing the embedding dimension of 
Y, as x„ = $(y n ) [8]. The opposite relation (probably with some time shift k) 
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is not guaranteed, although it looks a priori more natural in view of the fact 
that X is assumed to drive Y . If eq.(l) holds for some finite k, i.e. if the state 
of the driven system is a unique function of the driver's state, this is referred to 
as 'generalized synchronization' [9]. Here two cases have to be distinguished: 
strong generalized synchronization corresponds to smooth functions while 
weak generalized synchronization can lead to functions which may even be 
nowhere continuous [10,11]. In the latter case it might be difficult to detect 
synchronization by observing X and Y, while it is immediately seen when 
comparing two realizations Y^ and Y^ of the response system: if both are 
unique functions of the same X, then obviously Y^ = Y^ b \ i.e. they synchro- 
nize perfectly. 

Notice that this notion of 'generalized synchronization' is closer to the notion 
of interdependence, rather than to a mere time shift generating temporal co- 
incidences (this is what the word synchronization actually means). If a soften- 
ing of the concept of synchronization is accepted in this way, this 'generalized 
synchronization' is clearly not yet the weakest and most general form of syn- 
chronization. The weakest form is given just when X and Y, considered as 
stochastic processes, are not independent. The problem of finding weak effects 
of synchronization is thus equivalent to find weak interdependences. This is 
particularly true for a system as complex as e.g., the brain, where the question 
wether eq.(l) holds might be meaningless. 

Driver /response asymmetries, as mentioned in the above example, are indeed 
quite common also in stochastic systems. Distinguishing the driver from the 
responder is of course one of the central goals, particularly in medicine where 
it is of utmost importance to detect causal relationships. Unfortunately, no 
general method exists to detect such relationships unambiguously. Even if Y 
follows the motion of X with a time delay as in eq.(l), so that Y hardly could 
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drive X, this does not proof that X drives Y. Both systems might be driven 
by an unobserved third system Z . 

In particular, eq.(l) by itself does not imply that X drives Y. This is obvious 
in cases where ^ is bijective, i.e. where also is unique. If \I/ is not 

bijective (which, as we have seen, actually happens if Y drives X but fails 
to synchronize it), then, in general, there are several states of X which map 
onto a single state of Y. This will typically happen if the state space of X is 
larger than that of Y. For practical applications where strict equality cannot 
be observed but only closeness, this means that X has a larger attractor 
dimension (i.e. more effective degrees of freedom) than Y. But this does not 
imply any causal relationship. 

Typical observables used for detecting interdependences and synchronization 
are mutual information and cross correlations. Closely related to cross corre- 
lations are cross spectra. The main disadvantage of the latter two is that they 
measure only linear dependences. Causal relationships can (with the above 
caveats) be tested using time delays, i.e. by comparing (x m y n ) with (x n y m ). 
Mutual information is sensitive to all kinds of dependencies (it is zero only if 
X and Y are strictly independent), but its estimation imposes quite substan- 
tial requirements on the amount and quality of the data. In particular, if the 
suspected optimal embedding dimension is high, these requirements might be 
hard to meet. Finally, cross correlations and mutual information are symmet- 
ric in X and Y, so that causal relationships can be detected only if they are 
associated with time delays. A priori, causal relationships might exist without 
detectable delays and, as we have pointed out, there might exist delays which 
do not reflect the naively expected causal relationship. 

A new class of asymmetric interdependence measures which might overcome 
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some of these limitations has been proposed recently [9,8,12]. These authors 
have assumed that a deterministic relationship as in eq.(l) exists, and have 
therefore not optimized their observables so as to detect reliably weak in- 
terdependences in a noisy environment. Moreover, they assumed that eq.(l) 
automatically implies a causal relationship. That this is not unproblematic 
was discussed above. It is also seen from the fact that the authors of [8] and 
[12] drew exactly the opposite conclusions from mutual predictabilities of X 
and Y. Equation (1) was interpreted in [8] as indicating that Y is the driver 
and X the response, and that Y can be better predicted from X than vice 
versa. The opposite interpretation — namely that the response can be better 
predicted from the driver — was given in [12]. Nevertheless, these observables 
have been applied successfully to neurophysio logical problems [8,12]. 

In the present paper we present another interdependence measure following 
closely references [9,8,12]. But we do not assume eq.(l) and we try to make 
our definition such as to be most robust. Our observable, together with sev- 
eral alternatives, is defined in the next section. Applications to EEG signals 
recorded from electrodes implanted under the skull of patients suffering from 
severe epilepsies are presented in Sec. 3, while our conclusions are drawn in 
Sec.4. 



2 Outline of the Method 



Let X = (xi, X2, ■ ■ ■ , xn) and Y = (yi, y 2 , • • • , Un) denote two different simul- 
taneously observed time sequences. Typically, they will be measurements of 
different observables of the same complex system, or measurements taken at 
different positions of a spatially extended system. The internal dynamics of 
the system is not known. In particular, it is not known whether the system 
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is deterministic or stochastic, but we are mostly interested in cases where the 
latter is more likely a priori, or where it is at least unlikely that the attractor 
dynamics is so low that methods developed specifically for chaotic determin- 
istic systems would be applicable. Physical time is related to the index n of 
x n , respectively y n by t = t + en. 

Time-delay embedding [13] in an m-dimensional phase-space leads to phase- 
space vectors x n = (x n , . . . , x n _ (m _i )T ) and y n = (y n , . . . , y n _ (m _ 1)r ). The 
delay r can be chosen as 1, but for oversampled sequences it might be useful 
to use some integer r > 1. To simplify notation, we assume that also values 
X2- m , ■ ■ ■ , Xq and yi-m-, ■ ■ ■ ,Uo are given, so that all delay vectors with index 
1 < n < N can be formed, and the time sequences of delay vectors have N 
elements each. The arrays of all delay vectors will be denoted X = (x 1? . . . , xjy) 
and Y = (yi, . . .,y N ). 

Let r n j and s n j, j = 1, . . . , k denote the time indices of the k nearest neigh- 
bours of x n and y n , respectively. Thus, the first neighbour distances from 
x n are d(X)W = ||x„ - x rn J| = min 9 ||x n - x 9 ||, d(X)W = || Xn - x rn2 || = 
min ? ^ rnl ||x n — Xq||, etc., where ||x — x'|| is the Euclidean distance in delay 
space, and similar for y„. For each x n , the squared mean Euclidean distance 
to its k closest neighbours is defined as 



while the conditional mean squared Euclidean distance, conditioned on the 
closest neighbour times in the time series Y, is 



Notice that the only difference between these two is that we used the 'wrong' 






(3) 
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time indices for the neighbours in eq.(3). Instead of summing over nearest 
neighbours, we sum over those points whose equal time partners are nearest 
neighbours of y n . Similarly we define 

^( Y ) = TE(yn-y^) 2 ( 4 ) 

S=i 



1 A / \2 



and 



4 fc) (Y|X) = -E y„-y r „, 3 . (5) 

K 3=1 

If the point cloud {x n } has average squared radius R(X) = (R( N ~^ (X)) and 
effective dimension D (for a stochastic time series embedded in m dimensions, 
D = m), then R ( n k) (X) / R(X) ~ (k/N) 2 / D < 1 for k < N. The same is true 
for i?^(X|Y) if X and Y are perfectly correlated, i.e. if there is a smooth 
mapping x n = ^(y n )- On the other hand, if X and Y are completely indepen- 
dent, then i?^(X|Y) ^> R^(X). Accordingly, we introduce local and global 
interdependence measures S^ fe )(X|Y) and S'^^XjY) as 



n [ ' J "i^(X|Y) 



(6) 



and 



S «, ( X|Y)4| iS «(X| Y ,4|^. (T, 



Since R^(X\Y) > R^(X) by construction, we have 

< ,S (fc) (X|Y) < 1. (8) 



If S( fc )(X|Y) « {k/N) 2 l D < 1, then obviously X and Y are independent 
within the limits of accuracy. If, however, S^XjY) 3> (k/N) 2 ^ D , we say 
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that X depends on Y, thereby without implying any causal relationship. This 
dependence becomes maximal when S'^^XjY) — > I. 

The opposite dependences S^(Y\X) and S( fe )(Y|X) are defined in complete 
analogy. They are in general not equal to ^ fc) (X|Y) and S (fc) (X|Y). Both 
S'^(X|Y) and S^^YIX) may be of order 1. Therefore X can depend on Y, 
and at the same time can Y depend on X. If S (fc) (X|Y) > ,S (fc) (Y|X), i.e. if X 
depends more on Y than vice versa, we say that Y is more "active" than X. 
Again we do not imply this to have any causal meaning, a priori. An important 
question is whether an active/passive relationship, as defined in this way, has 
a causal driver /response interpretation in certain circumstances. 

In order to understand the origin of active/passive relationships, we consider 
the simple case where both time sequences are identical, X = Y, but we use 
different embedding dimensions m x and m Y in the delay vector construction. 
More precisely, we take mx < my and mx < m opt , where m opt is an optimal 
embedding dimension in the sense that for m < m opt the point cloud {x n } 
is not completely unfolded, while it is unfolded for m > m opt . Thus each x n 
can be considered as a singular projection of y n , x n = ^/(y n ) with non-unique 
inverse Assume now that y s is a close neighbour of y n . Then also x s must 
be a close neighbour of x n . But the opposite is not true: Closeness in x space 
does not imply closeness in y space. Therefore, conditioning on times s where 
y s are close neighbours of y n has less effect for neighbours of x n than vice 
versa, and S'^(XIY) > S^YjX). Although this is not a mathematically 
rigorous argument, it shows clearly that the active/passive relationship, as 
defined above, mainly reflects the relative number of degrees of freedom and 
not a driver /response relationship. Systems with many degrees of freedom 
(high dimensional "attractors" ) are more active than those with few. 
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Notice, however, that is sensitive only to those degrees of freedom which 
are excited with amplitudes of order R^. The latter depends, among others, 
on k and on N. The tendency of (weakly) coupled systems to have degrees of 
freedom which are excited with very small amplitudes is well known [14,15]. 
It often leads to wrong estimates of attractor dimensions, and it can make the 
observable active/passive relationship to depend on parameters such as k and 
N [16]. It might be responsible for the contradictory results of [8,17,12]. 

Before leaving this section, we point out several possible generalizations and 
alternatives. 

(a) Using the same Euclidean distance to define neighbours and in the sums in 
eqs.(2)-(5) is not necessary. Instead of the geometrical distance, in eqs.(2)-(5) 
we could have used any other dissimilarity measure between x n resp. y n and 
the point clouds {x rnj } etc.. If we would have used forecasting errors in local 
forecasts based on these clouds, we would have arrived at interdependence 
measures very similar to those of [8,17]. In [8], also 'zero time step' forecasting 
was studied. This is most closely related to our observables, but it uses only 
the distance between x n and the center of mass of the point cloud {x Sn , j = 
1, . . . k}, while we use all distances |x n — x Sn | individually. It is clear that the 
latter contains more information, and should therefore be more sensitive. 

(b) Instead of using arithmetic averages as in eqs.(2)-(5) and (7), we could 
have used geometric or harmonic averages. And we could have replaced the 
average of ratios in eq.(7) by a ratio of (arithmetic, geometric, or harmonic) 
averages. Again this could severely change sensitivity and robustness. We have 
not made an exhaustive test of all alternatives, but we checked that the above 
definitions are more robust than several alternatives. For instance, replacing 
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eq.(7) by 



S ik \X\Y)' 




(9) 



gave much more noisy results in the applications discussed in the next section 
which were also much harder to interpret physiologically. This is easily under- 
stood. In S', occasional very small values of i?^(X) have much more influence 
than in S. Such small values are obtained if x n depends abnormally weakly on 
Y, which might arise from some perturbation acting at time n. Thus S is more 
robust against shot noise than S'. We found similar results when using har- 
monic averages in eqs.(2)-(5). The main difference between the present paper 
and [9] is that these authors were interested in the case of noiseless determinis- 
tic attractors and strong interdependences where these considerations play no 
role, and they therefore did not try do find the most robust observable. Also, 
they dicussed only the case k — 1. This gives the strongest signal, but it is also 
much stronger affected by noise than k > 1. In the following applications we 
used k — 10 which seemed to give the best signal to noise ratio (see below). 

(c) In eq.(6) we essentially compare the Y-conditioned mean squared distances 
to the mean squared nearest neighbour distances. Instead of this, we could 
have compared the former to the mean squared distances to random points, 
R n (X) = (N — l)" 1 X^y n ( x n _ x j) 2 - Also, let us use the geometrical average 
in the analogon of eq.(7), and define 



This is zero if X and Y are completely independent, while it is positive if 
nearness in Y implies also nearness in X for equal time partners. It would 
be negative if close pairs in Y correspond mainly to distant pairs in X. This 



1 N 

# (fc) (X|Y) = -£log 

iV 71=1 



i?i fc) (X|Y) 



R n (X) 



(10) 
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is very unlikely but not impossible. Therefore, if^(X|Y) = suggests that 
X and Y are independent, but does not prove it. This (and the asymmetry 
under the exchange X <-> Y) is the main difference between if( fe )(X|Y) and 
mutual information. The latter is strictly positive whenever X and Y are not 
completely independent. As a consequence, mutual information is quadratic in 
the correlation P(X, Y) — P(X)P(Y) for weak correlations (P are here prob- 
ability distributions), while H^(X.\Y) is linear. This might make ifW(X|Y) 
useful in applications. 

(d) Instead of eq.(3) we could have defined the time shifted generalization 



with some (positive or negative) integer /. The idea behind this definition is 
that it is not clear a priori that x n is most closely related to the simultaneous 
vector y n . Rather, if there are some time delays in generating either x n or 
y n , the 'natural' partner of x n might be y„+/. In this way we can introduce 
a further element of asymmetry which could give additional hints on causal 
relationships. 

(e) Up to now, we have assumed in general that we use the same embedding 
for X and for Y. This is not necessary, and we could have used a different 
embedding dimension m and a different delay r for Y. We did not follow this 
path since X and Y had similar characteristics in examples studied in the next 
section. But it is worth while to point out that we can use our interdepen- 
dence measure for pairs of time series with completely different characteristics 
(amplitudes, spectra, etc.). Dependence does not imply similarity in any sense! 

(f) Instead of the Euclidean distance we could have used any other distance 
in defining neighbourhoods, e.g. the maximum norm. 




(11) 
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3 Application 

3.1 Data Acquisition 

We analyzed electroencephalographic signals (EEG) that were recorded in pa- 
tients suffering from pharmacoresistant focal epilepsies. In these patients free- 
dom of seizures can be obtained by resecting the part of the brain responsible 
for seizure generation. Taking such sort of data is mandatory as part of the 
presurgical analysis. The sensoring electrodes are left in the brain for typically 
2 to 3 weeks. During this time the patients are also watched by video, so that 
EEG activity can be matched with behavior, and seizures can be identified 
from either. The analyses reported here were made after surgery had taken 
place, and after it had become clear from its success whether the localization 
of the epileptic focus had been correctly predicted. 

EEG was recorded from electrodes implanted under the skull, hence close to 
the epileptic focus and with high signal-to-noise ratio. In particular, we used 
two types of electrodes: rectangular flexible grids of 8 x 8 contacts placed onto 
the cortex, and pairs of needle shaped depth electrodes with 10 contacts each, 
implanted into deeper structures of the brain (see fig. 1). 

EEG signals were sampled at 173 Hz using a 12 bit analog-to-digital (A/D) 
converter and filtered within a frequency band of 0.53 to 40 Hz. The cutoff 
frequency of the lowpass filter was selected to suppress possible contamination 
by the power line. For more details on the data and recording techniques, see 
[18,19] and references given therein. The data sets analyzed in this study had 
a duration of 10 minutes each (cut out from much longer sequences) and were 
divided into segments of T seconds each. Neighbours were searched only within 
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the same segment. 

3.2 Parameter Selection 

As is well known, details of the delay embedding such as choice of embedding 
dimension to and delay r can be very important. In principle, the theorems 
of Takens [13] and Sauer et al. [20] state that results should not depend on 
them if data are noiseless and N is arbitrarily large, but reality tells differ- 
ent. Many methods have been proposed to find "optimal" parameter values. 
However, appropriate choices of to and r strongly depend on specific aspects 
of the problem at hand (such as noise level, type of noise, intermittency, sta- 
tionarity, etc.). Thus general recipes which do not take into account these 
factors can be misleading. This holds true in particular for estimates of to 
based on false nearest neighbours [21]. One of the most popular recipes [22] 
for determining the optimal delay r is based on minimizing the mutual infor- 
mation in a £wo-dimensional embedding. But in general the same r does not 
minimize the mutual information in an embedding to > 3 dimensions [23]. 
The same comment applies to estimates of r from the first zero of the au- 
tocorrelation function. Therefore we used none of these a priori estimates of 
"optimal" embedding parameters in this study. Instead, we approached the 
problem empirically by calculating S'^(XIY) and S'^(YIX) for different val- 
ues of to, r, T, and k. In addition, we applied also a Theiler correction [24] 
by restricting the nearest neighbour times r n j and s n j to \n — r n j\ > r Thei i er 
and | 

^ — s n j | ^ TTheiier; and tested several values for Trheiier- If is of course not 
feasible to make a systematic search for all possible combinations of these pa- 
rameters, but we feel sure that our final choices are reasonable and not too far 
from the optimum. We made these optimizations out of sample, i.e. we used 
a well understood 'training' data set where we could judge the reasonability 
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of our observables by comparing with the medical diagnosis. This training set 
was not used as test set in any of the subsequent analyses. The "optimal" 
parameters are m — 10 (embedding dimension), r = 5 (delay in units of sam- 
pling time), k — 10 (neighborhood size), T = 10 (segment length in seconds), 
and Trheiier = 10. Indeed, somewhat better results were in some cases obtained 
with larger k (up to k — 100), but we stuck to the above because it was faster 
without too much loss of significance. The delay r = 5 was implemented by 
simply decimating the time sequences, thereby reducing effectively the sam- 
pling rate from 173 Hz to 34.6 Hz. Thus, each segment contained 346 delay 
vectors. 

3.3 Data Representation 
3. 3. 1 Depth Electrodes 

From the 20 time sequences recorded via the depth electrodes 400 combina- 
tions have to be analyzed. Results can be arranged into a 20 x 20 interdepen- 
dence matrix Sij = S'^(Xj|Xj). We present our results graphically by means 
of encoding each pixel in a 20 x 20 array using a grey scale. Pixel is black 
if = 1 (Xj and Xj are identical; this happens on the diagonal), while it 
is white if S^ = 0. The numbering of channels and their arrangement in the 
matrix are explained in fig. 2. 

Quadrants I and IV represent interdependences between signals from the same 
(left resp. right) hemisphere, while quadrants II and III show interdependences 
between different hemispheres. More precisely, if a pixel in quadrant II 
is darker than its partner (j, i) in quadrant III, the region around contact i in 
the right hemisphere is more active than the region around contact j in the 
left hemisphere. Of particular interest are also average values of S^, i.e. aver- 
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aged over a region symmetric under reflection along the diagonal. The average 
darkness of such a region is a direct measure of its average interdependences 
with other parts of the brain involved. 

A typical example of a grey scale pattern is shown in fig. 3 exhibiting two 
regions of high interdependence in both the left hemisphere and the right 
hemisphere. In this case the depth electrodes were not placed in a completely 
symmetrical fashion. While the electrode in the left hemisphere had 4 contacts 
in the entorhinal cortex and 6 contacts in the hippocampus, the right electrode 
had 3 contacts in the entorhinal cortex and 7 in the hippocampus. This dif- 
ference (confirmed by MRI images) is clearly seen in fig. 3. In addition, there 
is a stronger interdependence between entorhinal cortex and hippocampus on 
the left than on the right side, and the left hippocampus can be assumed to 
be more active than the right one. Interpretations of the latter will be given 
in sec. 3.4.1. 

3.3.2 Grid Electrodes 

Since grid electrodes consisted of 64 contacts, it is not very practical to rep- 
resent the data in the same way as for the depth electrodes. In addition, 
labeling the contacts by means of a single index will result in a loss of all 
neighbourhood information, and the patterns would be hard to interpret. A 
different representation is obtained by displaying each contact as a plaquette 
of an 8 x 8 matrix, and indicating the activity patterns by arrows connecting 
these plaquettes [17,12]. But also such a picture (which is optimal for a small 
number of electrodes) is too much packed with information for our present 
applications to be useful. 

We proceeded differently. We first averaged all 60 matrices obtained by cut- 
ting the 10 minutes recording into intervals of 10 seconds. The resulting 
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time- averaged interdependences are called S'W(X ili j 2 |X J1 j 2 ) where (11,12) and 
(31,32) are the coordinates of the contacts. We next perform a ranking of all 
entries in the 64 x 64 matrix except the elements on the diagonal. Using the 
highest one percent of entries after ranking and taking the lower end as a 
cutoff S c , we define for each contact (11,12) an average activity 

Ai,i 2 — ^2 ^KXjijalXj^jjO^^X^jj |X ilji2 ) — S c ) (12) 
and an average passivity 

p iui2 = £ 5( fe )(x, 1 , 2 |x, 1J2 )e(^)(x ll , l2 |x, 1 , 2 ) - s c ). (13) 

The cutoff S c is introduced in order to eliminate the effect of contact pairs 
with very weak interdependence. For these pairs, ^^(X^X^) is dominated by 
noise, and including them would mainly decrease the signal-to-noise ratio. 

Using the coordinates %i and i 2 we can finally represent A ilti2 and Pi u i 2 as 8 x 8 
grey scale matrices. Alternatively, we can add them and represent the sum 
Aj lj j 2 + Pi lt i 2 as a grey scale matrix. An example is given in fig. 4 exhibiting 
a region with very strong interdependence near the lower right corner. Its 
interpretation will be given in the next section. 

3.4 Results 

Our results are illustrated by three examples covering lateralization of the focal 
brain side, precise focus localization in neocortical epilepsies, and changes 
of interdependences before an impending seizure. These examples are quite 
typical. A more systematic study involving statistically significant samples is 
under way and will be presented elsewhere. 
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3.4-1 First Example 



We analyzed 10 minutes of an interictal (seizure-free interval) EEG of a patient 
suffering from a so called mesial temporal lobe epilepsy. The clinical workup 
suggested the epileptic focus to be located in the left hemisphere of the brain. 
We divided the EEG data set into 60 nonoverlapping consecutive 10 seconds 
segments and calculated a 20 x 20 S-matrix for each segment as described 
above. One of these matrices was already shown in fig. 3. This figure is typical 
for all 60 matrices in showing more interdependences in the left hemisphere 
than in the right. This concerns both interdependences within the hippocam- 
pus, and between hippocampus and adjacent cortex. Indeed, surgery on the 
left side resulted in complete seizure control of this patient. This suggests that 
our proposed measure might be able to lateralize the focal side of the brain. 

3-4-2 Second Example 

We analyzed 10 minutes of interictal EEG data from a patient suffering from a 
neocortical lesional epilepsy. In this case an 8 x 8 grid electrode was implanted 
covering the underlying brain lesion. Again the data set was subdivided as in 
example one. 

A typical activity-passivity matrix obtained by means of the procedure de- 
scribed in sec. 3.3.2 is shown in fig. 4. As already pointed out in sec. 3.3.2, 
we observed highest interdependences in regions near the lower right corner. 
Indeed, the patient was operated on exactly in this region (which had been 
identified during presurgical evaluation) and is now free of seizures. 
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3.4-3 Third example 



In contrast to the afore mentioned examples, where we used only EEG record- 
ings from a seizure free interval and averaged the data over time, we now study 
S as a function of time. Our time resolution is again T = 10 sec. Of particular 
interest are changes of S before an impending seizure, as this could finally 
lead to its prediction [25,26] But also changes during seizures and during 
the postictal (after-seizure) period are of interest. 

A sequence of interdependence patterns taken before, during and after a 
seizure is shown in fig. 5. The pattern of interdependences within the right 
hemisphere remains almost constant, even during the course of the seizure. 
On the other hand, S'-values of the left hemisphere change dramatically. As 
confirmed by successful surgery, the left hemisphere was the focal side in this 
case. During the preictal stage, S decreases from a high initial level to almost 
zero. Notice, that S is very low also in quadrant II directly before seizure 
onset, indicating that the left hemisphere is much less active. In frame #13, 
shortly before the onset of the seizure, interdependence builds up again on 
the left side. It reaches its maximum during the seizure and finally declines 
towards the interictal level. 

This coincides with findings of Lehnertz and Elger [25] who found reduced 
complexity before an impending seizure. Notice that "activity" according to 
our definition essentially depends on the number of excited degrees of freedom, 
which is exactly what was measured in [25]. The loss of activity before the 
seizure onset can be interpreted as a more or less hidden pathological syn- 
chronization phenomenon. It is assumed that seizure activity will be induced 
when a "critical mass" of neurons is progressively involved in closely time- 

1 The results of [26] use a vague definition of the interictal period and might there- 
fore be questionable. 
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linked high-frequency discharging. This critical mass might be reached if the 
preceeding level of synchronization decreases, enabling neurons to establish a 
synchronization which is high enough to finally lead to seizure activity. 

At first sight it may therefore seem paradoxical that interdependences decrease 
before a seizure. But this might indeed be exactly what happens. In a healthy 
brain a critical mass is never reached because neurons are strongly tied into 
networks where they communicate with others. A critical stage may be reached 
when a large population is "idle" and therefore on the one hand uncorrelated 
with the rest of the network, but on the other hand, easily recruitable for 
subsequent coherent pathophysiological activity. 

4 Discussion 

We have presented an observable which can detect dependences between si- 
multaneously measured time sequences. It is similar to other synchronization 
measures proposed recently, but is somewhat simpler and more robust. With 
the other measures it shares the property of being asymmetric. In principle, 
it can be assumed that our measure can indicate causal relationships. This 
might be useful identifying the driver of the two subsystems emitting the se- 
quences. We claim that such information might be obtainable in principle, 
but the interpretation is subtle and naive arguments can be quite misleading. 
Nevertheless, this asymmetry is very interesting. It mainly depends on the 
difference in 'activity' which measures the effective number of excited degrees 
of freedom. This effective number of active degrees of freedom depends on the 
scales to which the observable is most sensitive. In principle, in an asymmetric 
driver-response pair the attractor dimension of the response is always at least 
as high as that of the driver (if both are deterministic), but this might be 
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relevant only at length scales which are too small to be resolved practically. 

Our measure could also be used to detect generalized synchronization, but 
we do not assume in our applications that the signals are chaotic with low 
dimensions. In contrast to recent attempts to detect phase synchronization 
in brain signals, our measure does not treat phase information different from 
amplitude information, and thus we cannot discuss phase or frequency locking. 

We applied our measure to intracranial multichannel EEG recordings taken 
from patients suffering from severe epilepsies. We found significant depen- 
dences between different recording sites, and these dependences were in general 
not symmetric. Due to the careful pre-operational screening of these patients 
and their observation after being operated, we could compare our results in de- 
tail with other neurophysiological findings. The most interesting preliminary 
results are the following: 

1) During seizure-free intervals, the seizure generating area of the brain ex- 
hibited higher interdependences than other brain areas. 

2) Some seizures analyzed here were preceeded by short periods (30 s to several 
minutes) during which extremely low dependences were confined to the seizure 
generating area. 

Although these results are very encouraging, a more systematic study is needed 
and is under way. In addition, a host of further investigations is imaginable. 
Obvious candidates are the influences of drugs, the effect of mental activity 
(epilepsy patients behave normal even with implanted electrodes), or of var- 
ious stimuli. Another important problem is the determination of the 'critical 
mass of neurons needed to trigger a seizure. Moreover, a more systematic 
comparison with other diagnostical tools is necessary beforehand. Finally, the 
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present findings already suggest a number of physiological results whose in- 
terpretation demands a thorough theoretical study. For instance, it is a priori 
not clear whether a seizure is primarily triggered by a change of activity in 
the seizure generating area, or a change of susceptibility of the surrounding 
regions. We hope that the near future will show progress along these lines. 
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Figure captions: 



Fig. 1: Schematic view of the two types of intracranial electrodes used in this 
paper. Grids were placed onto the cortex and have either 8x8 electrodes. 
Needle shaped depth electrodes have ten contacts each and were always used 
pairwise in a left-right symmetrical fashion. In some cases, depth electrodes 
and grids were used together. 

Fig. 2: Scheme of subdivision of the 20x20 matrix Sij. The indices L\ to 
L w denote the contacts on the left depth electrode, from innermost (Li) to 
outermost (L w ). Similarly, Ri to R w correspond to the right depth electrode. 
The index i runs horizontally, while j runs vertically. E.g., quadrant II shows 
the effect of conditioning right hemispheric channels on the channels from left 
hemisphere. 

Fig. 3: Example for a 20x20 ^-matrix of a 10 sec. segment recorded during 
the seizure free interval using 10 depth electrodes on each side of the brain. 

Fig. 4: (A) Average activity pattern in an 8 x 8 grid electrode; (B) average 
passivity and (C) normalized sum of both. 

Fig. 5: Sequence of interdependence patterns including preictal (1-14), ictal 
(15-16) and postictal (17-20) brain electrical activity. 
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Figure 1 
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